Spoken Dialogue System Using Prosody as Para-Linguistic Information
نویسندگان
چکیده
An attitude recognizer of a speaker which uses prosodic features of speech is proposed and it is successfully applied to the dialogue system aiming at agreement formation. We use not only linguistic information but also some sorts of additional information supporting linguistic information in our human communication. In agreement formation dialogues, we are often required to express our attitude (positive or negative) to conversational partners’ proposals. We sometimes reply explicitly in linguistic information. We sometimes reply information ambiguously. However, even in the ambiguous case, we implicitly express our attitude using prosodic information. By realizing the abilities of catching these nuances, the dialogue system can be more sophisticated. In this paper, we implemented an attitude recognizer based on the GMM using prosodic feature parameters. The performance of the system is comparable to the human ability. We also realized a proto-type of spoken dialogue system using the recognizer. We show how these abilities contribute to efficient conversation.
منابع مشابه
Prosody based attitude recognition with feature selection and its application to spoken dialog system as para-linguistic information
In this paper, prosody-based attitude recognition and its application to a spoken dialog system are proposed. Paralinguistic information plays a important role in the human communication. We aimed to recognize the user’s attitude by prosody, and apply it to a spoken dialog system as para-linguistic information. In order to find important features to recognize the attitude from automatically ext...
متن کاملA hybrid approach to spoken dialogue understanding: prosody, statistics and partial parsing
Linguistic processing in spoken dialogue systems has to be robust against a large number of phenomena such as recognizer errors, spontaneous speech phenomena and out-of-vocabulary (OOV) words. A commonly used solution to this problem is partial parsing, that aims at detecting only parts of sentences/utterances that are vital for the respective task of the parser. In our paper we present a frame...
متن کاملA framework of reply speech generation for concept-to-speech conversion in spoken dialogue systems
Due to recent advancements in speech technologies, a large number of spoken dialogue systems have been constructed. However, since most of them adopt existing text-to-speech synthesizers, it is rather difficult to reflect the linguistic information obtained during the reply sentence generation well in output speech. A framework is necessary for correctly reflecting higher-level linguistic infor...
متن کاملPersonal Statement for Gina - Anne Levow
My research is strongly interdisciplinary, drawing on methods from computer science to investigate fundamental linguistic questions and applying findings from linguistics to develop improved techniques for automatic computational understanding of natural language. My research lies at the intersection of computational linguistics, natural language processing (NLP), and spoken language processing...
متن کاملMessage-To-Speech: High Quality Speech Generation For Messaging And Dialogue Systems
In this paper, we present a Message-toSpeech (MTS) system that offers the linguistic flexibility desired for spoken dialogue and message generating systems. The use of prosody transplantation and special purpose prosody models results in highly natural prosody for the synthesised speech.
متن کامل